A De-identification Method for Bilingual Clinical Texts of Various Note Types

نویسندگان

  • Soo-Yong Shin
  • Yu Rang Park
  • Yongdon Shin
  • Hyo Joung Choi
  • Jihyun Park
  • Yongman Lyu
  • Moo-Song Lee
  • Chang-Min Choi
  • Woo-Sung Kim
  • Jae Ho Lee
چکیده

De-identification of personal health information is essential in order not to require written patient informed consent. Previous de-identification methods were proposed using natural language processing technology in order to remove the identifiers in clinical narrative text, although these methods only focused on narrative text written in English. In this study, we propose a regular expression-based de-identification method used to address bilingual clinical records written in Korean and English. To develop and validate regular expression rules, we obtained training and validation datasets composed of 6,039 clinical notes of 20 types and 5,000 notes of 33 types, respectively. Fifteen regular expression rules were constructed using the development dataset and those rules achieved 99.87% precision and 96.25% recall for the validation dataset. Our de-identification method successfully removed the identifiers in diverse types of bilingual clinical narrative texts. This method will thus assist physicians to more easily perform retrospective research.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Intercultural Competence Formation: Teaching Reading of Profession-Related Texts in a Foreign Language to Agricultural Bilingual Students

The paper deals with the features of teaching of profession-related texts reading in a foreign language to bilingual students in agricultural higher education institution. Article’s purpose was to analyze the technology of intercultural competence formation by means of profession-related texts reading. The method of intercultural competence formation included using the profession-related texts ...

متن کامل

EFL Teachers’ Corrective Feedback and Students’ Revision in a Peruvian University: A descriptive study

This study explored the EFL teachers’ written corrective feedback (CF) techniques and their EFL students’ ability to integrate the CF while revising their texts. A total of 72 EFL students and 4 EFL teachers participated in this study. The data were collected through explicitation interviews administered to teachers and students, as well as through students’ written productions. A content analy...

متن کامل

High-Performance Bilingual Text Alignment Using Statistical and Dictionary Information

This paper describes an accurate and robust text alignment system for structurally different languages. Among structurally different languages such as Japanese and English, there is a limitation on the amount of word correspondences that can be statistically acquired. The proposed method makes use of two kinds of word correspondences in aligning bilingual texts. One is a bilingual dictionary of...

متن کامل

EFL Teachers’ Corrective Feedback and Students’ Revision in a Peruvian University: A descriptive study

This study explored the EFL teachers’ written corrective feedback (CF) techniques and their EFL students’ ability to integrate the CF while revising their texts. A total of 72 EFL students and 4 EFL teachers participated in this study. The data were collected through explicitation interviews administered to teachers and students, as well as through students’ written productions. A content analy...

متن کامل

Extraction of Training Sets for Experimentation with Cross Language Information Retrieval Systems

In this paper we focus on methods, models and tools for the extraction of bilingual training / test sets useful for the (semi) automatic classification of textual documents. Such documents could be tutorials, technical specifications, articles, personal notes, etc. Another motivation for our research is the need for managing corpus of classified texts and especially parallel corpora (texts). We...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 30  شماره 

صفحات  -

تاریخ انتشار 2015